176 research outputs found

    Guidelines For Pursuing and Revealing Data Abstractions

    Full text link
    Many data abstraction types, such as networks or set relationships, remain unfamiliar to data workers beyond the visualization research community. We conduct a survey and series of interviews about how people describe their data, either directly or indirectly. We refer to the latter as latent data abstractions. We conduct a Grounded Theory analysis that (1) interprets the extent to which latent data abstractions exist, (2) reveals the far-reaching effects that the interventionist pursuit of such abstractions can have on data workers, (3) describes why and when data workers may resist such explorations, and (4) suggests how to take advantage of opportunities and mitigate risks through transparency about visualization research perspectives and agendas. We then use the themes and codes discovered in the Grounded Theory analysis to develop guidelines for data abstraction in visualization projects. To continue the discussion, we make our dataset open along with a visual interface for further exploration

    Exploring performance data with boxfish

    Get PDF
    pre-printThe growth in size and complexity of scaling applications and the systems on which they run pose challenges in analyzing and improving their overall performance. With metrics coming from thousands or millions of processes, visualization techniques are necessary to make sense of the increasing amount of data. To aid the process of exploration and understanding, we announce the initial release of Boxfish, an extensible tool for manipulating and visualizing data pertaining to application behavior. Combining and visually presenting data and knowledge from multiple domains, such as the application's communication patterns and the hardware's network configuration and routing policies, can yield the insight necessary to discover the underlying causes of observed behavior. Boxfish allows users to query, filter and project data across these domains to create interactive, linked visualizations

    Preserving Command Line Workflow for a Package Management System Using ASCII DAG Visualization

    Get PDF
    Package managers provide ease of access to applications by removing the time-consuming and sometimes completely prohibitive barrier of successfully building, installing, and maintaining the software for a system. A package dependency contains dependencies between all packages required to build and run the target software. Package management system developers, package maintainers, and users may consult the dependency graph when a simple listing is insufficient for their analyses. However, users working in a remote command line environment must disrupt their workflow to visualize dependency graphs in graphical programs, possibly needing to move files between devices or incur forwarding lag. Such is the case for users of Spack, an open source package management system originally developed to ease the complex builds required by supercomputing environments. To preserve the command line workflow of Spack, we develop an interactive ASCII visualization for its dependency graphs. Through interviews with Spack maintainers, we identify user goals and corresponding visual tasks for dependency graphs. We evaluate the use of our visualization through a command line-centered study, comparing it to the system's two existing approaches. We observe that despite the limitations of the ASCII representation, our visualization is preferred by participants when approached from a command line interface workflow.U.S. Department of Energy by Lawrence Livermore National Laboratory [DE-AC52-07NA27344, LLNL-JRNL-746358]This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]

    CcNav: Understanding Compiler Optimizations in Binary Code

    Full text link
    Program developers spend significant time on optimizing and tuning programs. During this iterative process, they apply optimizations, analyze the resulting code, and modify the compilation until they are satisfied. Understanding what the compiler did with the code is crucial to this process but is very time-consuming and labor-intensive. Users need to navigate through thousands of lines of binary code and correlate it to source code concepts to understand the results of the compilation and to identify optimizations. We present a design study in collaboration with program developers and performance analysts. Our collaborators work with various artifacts related to the program such as binary code, source code, control flow graphs, and call graphs. Through interviews, feedback, and pair-analytics sessions, we analyzed their tasks and workflow. Based on this task analysis and through a human-centric design process, we designed a visual analytics system Compilation Navigator (CcNav) to aid exploration of the effects of compiler optimizations on the program. CcNav provides a streamlined workflow and a unified context that integrates disparate artifacts. CcNav supports consistent interactions across all the artifacts making it easy to correlate binary code with source code concepts. CcNav enables users to navigate and filter large binary code to identify and summarize optimizations such as inlining, vectorization, loop unrolling, and code hoisting. We evaluate CcNav through guided sessions and semi-structured interviews. We reflect on our design process, particularly the immersive elements, and on the transferability of design studies through our experience with a previous design study on program analysis.Comment: IEEE VIS VAST 202

    Alleviating Environmental Health Disparities Through Community Science and Data Integration

    Get PDF
    Environmental contamination is a fundamental determinant of health and well-being, and when the environment is compromised, vulnerabilities are generated. The complex challenges associated with environmental health and food security are influenced by current and emerging political, social, economic, and environmental contexts. To solve these “wicked” dilemmas, disparate public health surveillance efforts are conducted by local, state, and federal agencies. More recently, citizen/community science (CS) monitoring efforts are providing site-specific data. One of the biggest challenges in using these government datasets, let alone incorporating CS data, for a holistic assessment of environmental exposure is data management and interoperability. To facilitate a more holistic perspective and approach to solution generation, we have developed a method to provide a common data model that will allow environmental health researchers working at different scales and research domains to exchange data and ask new questions. We anticipate that this method will help to address environmental health disparities, which are unjust and avoidable, while ensuring CS datasets are ethically integrated to achieve environmental justice. Specifically, we used a transdisciplinary research framework to develop a methodology to integrate CS data with existing governmental environmental monitoring and social attribute data (vulnerability and resilience variables) that span across 10 different federal and state agencies. A key challenge in integrating such different datasets is the lack of widely adopted ontologies for vulnerability and resiliency factors. In addition to following the best practice of submitting new term requests to existing ontologies to fill gaps, we have also created an application ontology, the Superfund Research Project Data Interface Ontology (SRPDIO)

    Genetic polymorphisms of the GNRH1 and GNRHR genes and risk of breast cancer in the National Cancer Institute Breast and Prostate Cancer Cohort Consortium (BPC3)

    Get PDF
    RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are.Abstract Background Gonadotropin releasing hormone (GNRH1) triggers the release of follicle stimulating hormone and luteinizing hormone from the pituitary. Genetic variants in the gene encoding GNRH1 or its receptor may influence breast cancer risk by modulating production of ovarian steroid hormones. We studied the association between breast cancer risk and polymorphisms in genes that code for GNRH1 and its receptor (GNRHR) in the large National Cancer Institute Breast and Prostate Cancer Cohort Consortium (NCI-BPC3). Methods We sequenced exons of GNRH1 and GNRHR in 95 invasive breast cancer cases. Resulting single nucleotide polymorphisms (SNPs) were genotyped and used to identify haplotype-tagging SNPs (htSNPS) in a panel of 349 healthy women. The htSNPs were genotyped in 5,603 invasive breast cancer cases and 7,480 controls from the Cancer Prevention Study-II (CPS-II), European Prospective Investigation on Cancer and Nutrition (EPIC), Multiethnic Cohort (MEC), Nurses' Health Study (NHS), and Women's Health Study (WHS). Circulating levels of sex steroids (androstenedione, estradiol, estrone and testosterone) were also measured in 4713 study subjects. Results Breast cancer risk was not associated with any polymorphism or haplotype in the GNRH1 and GNRHR genes, nor were there any statistically significant interactions with known breast cancer risk factors. Polymorphisms in these two genes were not strongly associated with circulating hormone levels. Conclusion Common variants of the GNRH1 and GNRHR genes are not associated with risk of invasive breast cancer in Caucasians.Published versio

    Evaluation of polygenic risk scores for breast and ovarian cancer risk prediction in BRCA1 and BRCA2 mutation carriers

    Get PDF
    Background: Genome-wide association studies (GWAS) have identified 94 common single-nucleotide polymorphisms (SNPs) associated with breast cancer (BC) risk and 18 associated with ovarian cancer (OC) risk. Several of these are also associated with risk of BC or OC for women who carry a pathogenic mutation in the high-risk BC and OC genes BRCA1 or BRCA2. The combined effects of these variants on BC or OC risk for BRCA1 and BRCA2 mutation carriers have not yet been assessed while their clinical management could benefit from improved personalized risk estimates. Methods: We constructed polygenic risk scores (PRS) using BC and OC susceptibility SNPs identified through population-based GWAS: for BC (overall, estrogen receptor [ER]-positive, and ER-negative) and for OC. Using data from 15 252 female BRCA1 and 8211 BRCA2 carriers, the association of each PRS with BC or OC risk was evaluated using a weighted cohort approach, with time to diagnosis as the outcome and estimation of the hazard ratios (HRs) per standard deviation increase in the PRS. Results: The PRS for ER-negative BC displayed the strongest association with BC risk in BRCA1 carriers (HR = 1.27, 95% confidence interval [CI] = 1.23 to 1.31, P = 8.2 x 10(53)). In BRCA2 carriers, the strongest association with BC risk was seen for the overall BC PRS (HR = 1.22, 95% CI = 1.17 to 1.28, P = 7.2 x 10(-20)). The OC PRS was strongly associated with OC risk for both BRCA1 and BRCA2 carriers. These translate to differences in absolute risks (more than 10% in each case) between the top and bottom deciles of the PRS distribution; for example, the OC risk was 6% by age 80 years for BRCA2 carriers at the 10th percentile of the OC PRS compared with 19% risk for those at the 90th percentile of PRS. Conclusions: BC and OC PRS are predictive of cancer risk in BRCA1 and BRCA2 carriers. Incorporation of the PRS into risk prediction models has promise to better inform decisions on cancer risk management
    • …
    corecore